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Abstract 

We address the problem of determining a natural local neighbourhood 
or "cluster" associated to a given seed vertex in an undirected graph. We 
formulate the task in terms of absorption times of random walks from 
other vertices to the vertex of interest, and observe that these times are 
well approximated by the components of the principal eigenvector of the 
corresponding fundamental matrix of the graph's adjacency matrix. We 
further present a locally computable gradient-descent method to estimate 
this Dirichlet-Fiedler vector, based on minimising the respective Rayleigh 
quotient. Experimental evaluation shows that the approximations behave 
well and yield well-defined local clusters. 
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1 Introduction and motivation 

1.1 Nonuniform networks 

The field of natural-network study became popular when Watts and Strogatz 
(1) published their observations on the short average path length and the high 
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Figure 1: A caveman graph (13) composed of six near-cliques of five vertices 
each that have been connected into a circulant graph by "opening" one edge 
from each clique (the removed edge is shown with a dotted line). 

clustering coefficient of many natural graphs, followed by the observations of 
scale-free distributions (2; 3) in the degrees and other structural properties of 
such networks. As a consequence of the resulting wide interest in the properties 
of natural networks, there now exist numerous models to meet the observations 
made on natural networks (4; 5; 6). 

1.2 Graph clustering 

One of the properties of interest in the field of natural graphs is the presence of 
clusters or communities (7), that is, the existence of dense induced subgraphs 
that have relatively few connections outside compared to the internal density 
(8). 

Graph clustering is the task of grouping the vertices of the graph into clusters 
taking into consideration the edge structure of the graph in such a way that there 
should be many edges within each cluster and relatively few between the clusters. 
For an artificial example, see Figure 1 that illustrates a small graph with a 
clear six-cluster structure. Another classic example is a small real-world social 
network studied by Zachary (9) and often referred to in graph clustering papers 
(5; 10; 11). It is a social network of a small karate club that was just about 
to split into two (see Figure 2), making it an ideal case for two-classification 
algorithms. For a survey on graph-clustering algorithms, sec (12). 

1.3 Local clustering 

In local clustering, the goal is to find the cluster of a given seed vertex s £ V. 
Hence, essentially, it is the task of finding a bipartition of the graph G into 
two vertex sets S and V \ S such that s£S and S makes a good cluster in 
some predefined sense. Common cluster quality criteria include cut capacity 
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Figure 2: The karate club social network studied by Zachary (9). The two groups 
into which the club split are indicated by the shape with which the vertices are 
drawn: the squares later formed their own club, and the circles formed another 



and related measures such as conductance (14) or density-based measures (15). 
Also methods motivated by electric networks have been proposed for global and 
local clustering alike (10; 11; 16). 

1.4 Spectra of graphs 

Let G = (V,E) be an unweighted undirected connected graph with at least 
two vertices. For simplicity, we focus on unweighted graphs, although much of 
what follows can easily be generalised to the weighted case. Denote the order 
of G, i.e. its number of vertices, by n and identify each vertex v with a label in 
{1,2,..., n}. Denote the seed vertex by s. The adjacency matrix of G is the 
binary matrix A, where aij — 1 if edge {i,j} is in E, and otherwise ciij = 0. 

For a weighted graph, one would consider instead the analogous edge-weight 
matrix. Note also that for multigraphs, edge multiplicities can in the present 
context be considered simply as integer weights. For an undirected graph, the 
adjacency (resp. edge weight) matrix is symmetric, whereas directed graphs pose 
further complications in the algebraic manipulation — we refer the reader to 
the textbook and other works of Chung (17; 18; 19; 20) for properties and local 
clustering of directed graphs. 

The degree d v of a vertex v is the number (resp. total weight) of its incident 
edges; thus the components of the degree vector d of G are the row sums of A. 
Denote by D the diagonal n x n matrix formed by setting the diagonal elements 
to da = di and all other elements to zero. 
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Let I be the nxn unit matrix. The Laplacian matrix of G is L = D — A and 
the normalised Laplacian matrix of G is C = D ^LD ^ = I D"5AD"5. 
Since both L and C are symmetric, all their eigenvalues are real. It turns 
out that C is in some respects a more natural object of study than L, and 
we shall mostly focus on that. It is easy to see that zero is an eigenvalue of 
both L and £, and for C it can be shown that all the other n — 1 eigenvalues 
(counting multiplicities) lie in the interval [0,2]. Denote these in increasing 
order as = ^ < /ii < • • • < n n -\ < 2, and let be some right eigenvector 
associated to /Zj. We may assume that the distinct eigenvectors are orthogonal 
to each other. For more information on the spectral and algebraic properties of 
graphs, see e.g. the excellent monographs of Biggs (21) and Chung (17). 

1.5 Random walks 

The simple random walk on a graph G is a Markov chain where each vertex 
v E V corresponds to a state and the transition probability from state i to state 
j is pij = df~ if {i,j} G E and zero otherwise. For a weighted graph, is the 
ratio of the weight of edge {i,j} to the total weight of edges incident to i. 

Denote the transition probability matrix of this Markov chain by P = D 1 A. 
Note that even for undirected graphs, P is not in general symmetric. However, 
it is similar to the matrix 

V = D3PD~3 = D ^AD 2 (i) 

which is symmetric because A is the adjacency matrix of an undirected graph. 
Thus, P and V have the same spectrum of eigenvalues, which are all real. 
Moreover, 

C = D 5LD -5 = D "t(D A)D 3 

= D "2(D DP)D = I D^PD 3 (2) 
= I-V. 

Consequently, A is an eigenvalue of the normalised transition matrix V if and 
only if [i = 1 — A is an eigenvalue of the normalised Laplacian matrix L. Thus, P, 
V and C have the following correspondence: v is a right eigenvector associated 
to eigenvalue A in P if and only if u = v is a right eigenvector associated to 
the same eigenvalue in V, and to eigenvalue 1 — A in C. 

Since in the case of Markov chains, left eigenvectors are also of interest, let 
us note in passing that the analogous correspondence holds between each left 
eigenvector tt of P and left eigenvector p = 7rD~2 of V or C. 

Denote the eigenvalues of P in decreasing order as Aq ' > X\ > ■ ■ > A„_' x . 

(PI 

Since P is a stochastic matrix, it always has eigenvalue AJ, = 1, corresponding 

( c) 

to the smallest Laplacian eigenvalue /Uq = 0. All the other eigenvalues of P 
satisfy |A| P) | < 1. If moreover G is connected and not bipartite, the Markov 
chain determined by P is ergodic, in which case | A| P) I < 1 for alH > 1. Without 

much loss of generality, we shall assume this condition, and moreover that all 
(p) 

the eigenvalues A- are nonnegative. Both of these conditions can be enforced 
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by considering, if necessary, instead of P the "lazy random walk" with transition 
matrix 

P' = i(I + P). (3) 

For a connected graph G this chain is ergodic, and has nonnegative eigenvalues 



A 



(p') 



1 



(4) 



with the same eigenvectors as P. 

Let us then consider a transition matrix P obtained from P by making a 
given state, or vertex s absorbing. Thus, P is otherwise equal to P, but all 
p S i = except for p ss = 1. We shall henceforth assume, for simplicity of 
notation, that s — n, so that in particular P has the block structure: 



Q 



Vo-.-o 1 J 



Pi \ 



Pn-1 



(5) 



The absorption time nii from vertex i ^ s to the seed vertex s is the expected 
number of steps that a walk initiated at i will take before hitting s. Intuitively, 
as the absorption time measures in a certain sense the proximity of vertex i to 
vertex s, vertices belonging to a good cluster S for s, if such a cluster exists, 
should have characteristically smaller absorption times to s than vertices in 
V \ S. Note that not all graphs exhibit a clustered structure, in which case no 
clustering method will be able to pinpoint a high-quality cluster (12). 

It is well known that the absorption times to vertex s — n can be calculated 
as row sums 

mi = m it i +m i;2 + . . . + m ii „_i. (6) 
from the fundamental matrix 



M = I + Q + Q 2 + Q 3 + 



(I-Q)- 



(7) 



where Q is the matrix obtained from P (or cquivalcntly from P) by eliminat- 
ing the row and column corresponding to vertex s = n (as shown above in 
Equation (5)), 

In Figure 3, we illustrate the absorption times in the caveman graph of Fig- 
ure 1: we computed with Matlab the absorption times from all vertices to a 
given seed vertex j, repeated the computation for each j G V, and formed a 
matrix where each column represents the absorption-time vector for the corre- 
sponding vertex j. The columns are ordered so that all absorption-time vectors 
associated to a given cave are grouped together, before those of the next cave, 
and so forth. The matrix is visualised as a gray-scale colour map by plac- 
ing a tiny black square where either niij = (that is, along the diagonal) or 
niij — 10.6 (the minimal off-diagonal absorption time observed), a white square 
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Figure 3: The absorption time matrix composed of 30 absorption-time vectors 
using each vertex of the caveman graph of Figure 1 in turn as a seed vertex, 
with white corresponding to the maximum rrijj thus obtained and black corre- 
sponding to the minimum mjj and the diagonal zeroes. 

where mjj = 319.6 (the maximum observed), and discrctising the intermediate 
values to 254 gray-scale colours correspondingly. The caves can be distinguished 
as dark five-by-five blocks along the diagonal, although the matrix is somewhat 
too noisy to be trivially clustered. 

Now consider the eigenvalue spectra of matrices P and Q. Matrix P is still 
stochastic, so it has largest eigenvalue A = 1, and since the chain is absorbing, 
all the other eigenvalues satisfy |A^| <l,i = l,...,n— 1. 

Denote Q = D3QD~^, where D = diag(<ii, . . . , d n -i). As Q is symmetric 
(it is obtained by eliminating the last row and column from the symmetric 
matrix V) and Q is similar to Q, both have a spectrum of real eigenvalues 
Spec(Q) = {X^ > ■ ■ ■ > A^\}. This spectrum is properly contained in the 
interval [—1,1], because for any vertex j/n adjacent to n, pi n > 0, and so the 
i th row sum of Q is less than 1. 

We claim that in fact 

Spec(Q) = Spcc(P) \ {1}. (8) 

To prove this claim, let namely A =/= 1 be any non-principal eigenvalue of P and 
v a corresponding eigenvector, so that Pv = Av. Since the n th row of P is zero 
except for p nn = 1, it follows that Xv n = (Pv)„ = v n , and since A ^ 1 that 
necessarily v n = 0. Then for the (n — l)-dimensional vector v' = (v\, . . . , v n -\) 
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and for any % = 1, . . . , n — 1 it holds that: 

n—1 n—1 n 

(Qv')i = y^Pij^j = y^PijVj = ^PijVj -PinVn 

j=i j=i i=i (9) 

= (Pv)j - U„Pi„ = (Pv)j 

= A«i = Xv[. 

Consequently, v' is an eigenvector associated to eigenvalue A of Q. Since A was 
chosen arbitrarily from Spec(P) \ {1}, this establishes that Spec(P) \ {1} C 
Spec(Q). For the converse direction, a similar argument shows that if v' = 
(vi,. .. ,v n -i) is an eigenvector associated to an eigenvalue A of Q, then the 
vector v = (v\, . . . , v n -\, 0) is an eigenvector associated to eigenvalue A of P. 

2 Spectral methods for bipartitioning 

2.1 Fiedler vectors 

Spectral clustering of points in space, often modelled as (complete) weighted 
graphs, is a widely studied topic (22; 23). In the context of graphs, the technique 
is usually applied so that some right eigenvector associated to the smallest 
nonzero eigenvalue /i^ of L is used to produce a bipartitioning of the graph 
such that those vertices that have negative values in the eigenvector form one 
side of the bipartition S and the vertices with positive values are the other side 
S\V. These eigenvectors are called Fiedler vectors following (24; 25), where the 
technique was first proposed. The corresponding eigenvectors based on C are 
called normalised Fiedler vectors. The works on Fiedler-vector based spectral 
clustering are numerous and go back for decades (26; 27; 28) 

For our example graph illustrated in Figure 1, such a bipartition based on L 
puts three of the caves in S such that it assigns negative values to every other 
cave along the cycle of six caves. Using the eigenvector of C, however, assigns 
only negative values in the vector and does not yield an intuitive division that 
preserves the caves. The two vectors are visualised in Figure 4. 

If there are only two natural clusters in the graph, such bipartition works 
nicely. An example is the Zachary karate club network of Figure 2: the corre- 
sponding Fiedler vectors are shown in Figure 5. Also, recursively performing 
bipartitions on the subgraphs induced by S and V\S will help cluster the input 
graph G in more than two clusters, but a stopping condition needs to be imposed 
to determine when to stop bipartitioning the resulting subgraphs further. 

2.2 Spectral partitioning as integer program relaxation 

The use of Fiedler vectors for graph bipartitioning can be motivated as follows 
(see for example (22)). Denote a cut (bipartition) of a graph G = (V,E) into 
vertex sets S and S = V \ S as (S, S). The capacity of a cut (S, S) is defined as 

C(S,S) = \{{i,j}eE:ieS,jeS}\. (10) 
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Figure 4: The components of the Fiedler vector (left) and the normalised Fiedler 
vector (right) for the caveman graph of Figure 1. For the human eye, the six- 
cluster structure is evident in the Fiedler vectors, whereas in the normalised 
Fiedler vector the vertices are grouped into four clusters (two of them consisting 
of two caves) . 



Fiedler vector Normalized Fiedler vector 




Vertices of the graph (cave by cave) Vertices of the graph (cave by cave) 



Figure 5: The components of the Fiedler vector (left) and the normalised Fiedler 
vector (right) for the karate club graph of Figure 2. The vertices can be classified 
in two groups: those with positive values in the Fiedler vector and those with 
negative values. 
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A cut (S, S) can be conveniently represented by an indicator vector v e {+1, — l}* 1 
where Vi = +1 if i 6 S, and Vi = — 1 if i € 5. 
Then 

= J5><- V *) 2 ' (11) 

where the sum is over all the (undirected) edges {i, j} E E. 

For simplicity, assume now that \V\ = n is even, and consider the task of 
finding an optimal bisection of G, i.e. a cut (S, S) that satisfies \S\ = \S\ = n/2 
and minimises C(S, S) subject to this condition. 

This is equivalent to finding an indicator vector v 6 {+1, —1}™ that satisfies 
Ei w « = and minimises the quadratic form J2i~j( v i ~ w i) 2 j or cquivalcntly 
(since n is fixed) minimises the ratio: 



n/4 



2 



Since the all-ones vector 1 is associated to the eigenvalue /j,^ = 0, we have by 
the Courant-Fischer characterisation of the smallest nonzero eigenvalue : 



(L) . v T £v . Ei~>i-"j) 2 

a) = mm = mm ^ — 5 , (12 

v_li v T v E,»i=o Ei^f 



where the minimum is taken over all vectors v^O satisfying the given condition. 
Since we can without loss of generality also constrain the minimisation to, say, 
the vectors of norm ||v|| 2 = n, we sec that the task of finding a Fiedler vector of 
G is in fact a fractional relaxation of the combinatorial problem of determining 
an optimal bisection of G. 

This correspondence motivates the previously indicated spectral approach 
to bisectioning a connected graph G (24; 29): 

1 . Compute Fiedler vector v G R" of G. 

2. Determine cut (S, S) by rule: 

Vi > 9 => ieS, 



Vi<6 => ieS, ^ 

where 9 is the median value of the Vi's. 

The use of normalised Fiedler vectors to graph bipartitioning was explored 
in (30), where it was shown that Fiedler vectors of C yield fractionally optimal 
graph bipartitions according to the normalised cut capacity measure: 

6 ^ = Wt + Wt- (14) 
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where Vol(S) = Y^ies d i- 

Since u is an eigenvector of £ with eigenvalue A if and only if v = D~2 u is 
eigenvector of D _1 L with eigenvalue A, the eigenvalue (j,^ can be characterised 
in terms of a "degree-adjusted" Rayleigh quotient: 

(£) . u^Cu . Ei~>i-"i) 2 
u = mm = mm =^ — - — = . (15) 

Since u is an eigenvector of £ with eigenvalue A if and only if v = D~"?u is 
eigenvector of D _1 L with eigenvalue A, the eigenvalue fj,^ can be characterised 
in terms of a "degree-adjusted" Rayleigh quotient: 

(£) . u T £u . T^^H-Vj? 
^ = mm — — = mm — . (16) 



ulDIl 



U T U vlDl J2i divf 



A natural extension of the spectral clustering idea to the local clustering 
context is to consider the Laplacian L or C together with the Dirichlet boundary 
condition that only clustering vectors v with the seed vertex v s fixed to some 
particular value are acceptable solutions. 

We follow (17; 31) in using the normalised Laplacian C and choosing v s = 0, 
or equivalently u s = (Di v) s = as the boundary condition. We thus aim to 
cluster according to the "Dirichlet-Fiedler vector" minimising the constrained 
Rayleigh quotient: 

. u T £u . £i~j("i-«j) 2 
mm = mm =^ — —5 . (17) 

u:« s =0 U T U v.v 3 =0 l^i "iVf 

For notational simplicity, assume again that s = n, and observe that for every 
vector u = (ui, . . . , u n -i, 0), the value of the Rayleigh quotient in equation (17) 
is the same as the value of the (n — l)-dimensional quotient with respect to 
vector u' = (u\, . . . ,u n -i) and Laplacian £ which equals C with its n th row 
and column removed. Thus, our clustering vector v is, except for the final zero, 
the one minimising: 

. (u') T C'u' . T,i~j( v 'i ~ v 'j) 2 (m , 

ml , n ( a t , = ml , n ~T~TT\2 ' ( 18 ) 

u' (u') U' V 22i d i( v i) 

i.e. v' = D"5u' for the principal eigenvector u' of the Laplacian £'. Let us 
denote v = v-^ and call this the local Fiedler vector associated to graph G and 
seed vertex s = n. 



3 Local Fiedler vectors and absorption times of 
random walks 

We shall now show that the components of the local Fiedler vector v' = 
(«!,... , v n -\) are in fact approximately proportional to the absorption times 
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vrii discussed in Section 1.5. The connection between the absorption time pro- 
vides a natural interpretation to the notion of the local Fiedler vector, and 
yields further support to the idea of local clustering by constrained spectral 
techniques. Previously random walks and spectral clustering have been jointly 
addressed by Meila and Shi (32) and local clustering by PageRank by Andersen, 
Chung, and Lang (18). Important papers linking structural properties of graphs 
to convergence rates of random walks via spectral techniques are (33; 34). 
Observe first, from equation (2), that: 

£' = I-DJQD-5=I-Q, (19) 

where Q is as in Equation (5) and D = diag(c?i, . . . , d n -i). 
Since Q is similar to Q, its spectrum satisfies: 

Spec(Q) = Spec(Q) = Spcc(P) \ {1}. (20) 

Thus, /j ^ is an eigenvalue of CI if and only if A = 1 — /j ^ 1 is an eigenvalue 
of both Q and Q. Moreover, if u is an eigenvector associated to eigenvalue A in 
Q, then v = D"5u is an eigenvector associated to the same eigenvalue in Q. 

Let then the eigenvalues of Q (or equivalcntly Q) be 1 > Ai > • • • > \ n -i > 
0. Since Q is symmetric, it has a corresponding orthonormal system of eigen- 
vectors ui, . . . , u„_i and a representation: 

n-l 

Q = ^X i u i u i T . (21) 

i=l 

Denoting the component matrices = u^u^ , we observe that by orthogonality 
of the eigenvectors we have XJiXJj — for i ^ j, and by normality Uf = U^. 
From these two observations it follows that: 

n-l 

Q* = ^A*U i; fort = 0,1,... (22) 

i=l 

Since Q = D ~i QD^ , we obtain from this for Q* the representation: 

71—1 n—1 

Q* = D"5Q*D5 =^A?(D-5u i )(u i T D^) = ^A^v i T D, (23) 

i=l i=l 

where = T)~^vn is an eigenvector associated to eigenvalue A, in Q. 

Substituting this to Equation (7) and denoting the (n — l)-dimensional all- 
ones vector by 1, we thus obtain an expression for the vector m of absorption 
times m, in terms of the eigenvalues and eigenvectors of Q, or equivalcntly Q: 

oo 

m = ^Q'l 

t=0 

oo n — 1 

= EE A ' v * v « TD1 ( 24 ) 

t = Q 2=1 
OO / 71 — 1 \ 

= EEW* 

t=0 \i=l / 
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where d = (di, . . . , rf„_i) T . 

Now if the principal eigenvalue Ai is well-separated from the others, i.e. if 
the ratio | A»/Ai | is small for i > 1, this yields a good approximation for m: 



m = 1 + X>* 



t=i 



v lVl T d + ^ (J^J v 2 v, T d 



(25) 



\ small-norm "noise" / 

oo 

» l + ^A* lV ivi T d 

t=i 

= l + -^-v lVl T d. 

1 — Ai 

Even in cases where there is no evident gap in the spectrum and hence near- 
equality cannot be assumed, we have found in our experiments that the approx- 
imations obtained are near-perfectly correlated with the exact absorption times 
for a variety of graphs. 

We study three example graphs to point out the strengths and weaknesses 
of the proposed approximation. The first example graph is the clustered but 
highly symmetric caveman graph of Figure 1, where the symmetries present 
cause problems for the proposed approximation. Our second example is the 
karate club network shown in Figure 2. The third example graph is a uniform 
random graph G n , P , with n — 100 and p = 0.1 (35), which by definition has no 
clear cluster structure, and hence the absorption times cannot be expected to 
have interesting patterns. 

In Figure 6, we show comparisons of some approximate and exact spectral 
computations for three example graphs. In each case, the highest-numbered 
vertex of the graph has been chosen as the unique seed vertex. It can be noted, 
from the top row of plots in Figure 6, that the spectra of the graphs' P matrices 
do not exhibit large gaps between their second and third largest eigenvalues. 
Thus, it can not be expected a priori that the Fiedler- vector based approxima- 
tions to the absorption times, from Equation (25), would be even of the same 
magnitude as the exact ones, as calculated from Equations (6) and (7). (Observe 
also how the structure of the caveman graph is reflected in the corresponding 
P spectrum: a notable eigenvalue gap occurs after the six largest eigenvalues, 
each representing the dominant convergence behaviour of one of the clusters.) 

Correlations between the approximate and exact absorption times are appar- 
ent in the quantile-quantile plots presented in the second row of Figure 6: here 
the values group diagonally when a linear dependency exists. The correlation 
is very high in all cases: 0.99863 for the caveman graph, 0.99636 for the karate 
club network, and 0.99999 for the uniform random graph. 

The two lowest rows in Figure 6 present the actual values of the exact and 
approximate absorption-time vectors, indexed by vertex number. These plots 
illustrate the usefulness of these quantities for performing a two-classification 
of the vertices into the local cluster of the seed vertex (low values) versus the 
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other vertices (high values). In fact, for the caveman graph, the full six-cluster 
structure is visible. In the karate club network it can be seen that two groups 
are present: one with high values and another one with low values. (Cf. Figure 
8, which indicates the "ground truth" clustering of the vertices in this graph.) 
As expected, the uniform random graph reveals no significant cluster structure, 
but the vertices near the seed vertex can be identified by their lower values, 
whereas most of the graph has another, higher value. 

In practice, it is not always interesting to compute the absorption times 
for all vertices, especially in local computation, in which case we may only 
have approximated some of the components of the Fiedler vector. For these 
situations, we may write the k th component of the result vector as 

rt-l 

(Q'l) fc = (^A*v 4 v 4 T Dl) fe 

i=l 

n—l n—1 

= EAKv^E^Md),. 



1=1 



From this we obtain for the absorption time from vertex k to vertex s the 
expression 



TOfc 



£(Q'i) 

t=0 

oo 



n-l / , \ t 
A; 



t=l \ i=2 

oc 



(27) 



t=i 

= 1+ 1 _ 1 A •ci-(vi) fe . 



Now for a given graph G, d is a constant and so we obtain the very simple 
approximate correspondence m « 1 + c'v^ between the absorption time vector 
m and the local Fiedler vector = vi. 

In order to compare the quality of the approximation as well as to illustrate 
the computational load in approximating by summing term by term the series 
of Equation (24), we calculated for each cutoff length the sum of squares of the 
differences between the partial sums and the exact absorption times, divided 
by the order of each of the three example graphs: the graph of Figure 1, the 
Zachary karate club graph of Figure 2, and the uniform random graph Q ntP . 
The resulting values over the set of vertices are shown in Figure 7 (on the left) 



13 



Caveman graph 



Karate club network 



Uniform random graph 



CL 



x 

LU 



Q- 
Q- 
< 




200 
100 




500 1000 
Approximate 



B 200 
cd 



100 

1000 
500 




40 
30 
20 
10 




100 200 
Approximate 



40 . 

20 

- 

150 
50 



180 
160 





□ ° 

-an ^ 


as 





1500 1600 1700 1800 1900 
Approximate 



180 



160 



1800 



1600 



on n ° □ °n 



□ □□ aDD D ncE 



Vertices 



Vertices 



Vertices of the graph 



Figure 6: Comparisons of approximate and exact spectral computations for 
three example graphs: the small graphs of Figures 1 and 2, and a uniform 
random graph Q n ,p, using a random vertex as the seed vertex. The top row 
presents the sorted spectra of the P matrices of the graphs, the second row 
plots the approximate and exact absorption-time values for the given seed vertex 
against each other, and the lowest two rows indicate the exact and approximate 
absorption-time values as ordered by vertex number. The bottom rows can be 
seen as illustrating the quantities' capability of distinguishing the cluster of the 
seed vertex (low values) from the other vertices (high values). 



14 



11 



100 200 300 400 500 600 700 800 
Iteration 



1 

0.95 
0.9 

0.85 
0.8 

0.75 
0.7 

0.65 



h a □ 



■■■Ills 



Random graph 
Karate club 
Caveman graph 



20 



40 60 
Iteration 



80 



100 



Figure 7: The sum of squares of the difference from the exact absorption-time 
(on the left) of estimate vectors with different cutoff values for approximating 
through Equation (24) and Pearson correlation between the exact and the esti- 
mate vectors (on the right) for the three example graphs: the small graphs of 
Figures 1 and 2, and the Q n , P - The values shown are averaged over the vertex 
sets of the two small graphs and over a set of 30 vertices selected uniformly at 
random for the G n ,p graph. The smallest standard deviation corresponds to the 
caveman graph and the largest to the uniform random graph. The horizontal 
lines (all three overlap between 0.980 and 0.997) correspond to the average cor- 
relation coefficients between the exact and the approximate absorption times of 
Equation (25). 



together with the Pearson correlations (on the right) achieved at each iteration. 
In both plots, mean and standard deviation are shown. 

4 Local approximation of Fiedler vectors 

We take as a starting point the Rayleigh quotient of Equation (18). Since we 
are free to normalise our eventual Fiedler vector v' to any length we wish, we 
can constrain the minimisation to vectors v that satisfy, say, ||v||2 = n = \V\. 
Thus, the task becomes one of finding a vector v that satisfies for a given s EV: 

v / = argminj^^ - v k f : v s = 0, ||v||2 = n\. (28) 

We can solve this task approximately by reformulating the requirement that 
||v||| = n as a "soft constraint" with weight c > 0, and minimising the objective 
function 

/(v) = 5E( w J-**) a + r( n -S>?) ( 29 ) 
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by gradient descent. Since the partial derivatives of / have the simple form 



the descent step can be computed locally at each vertex at time t + 1, based on 
information about the values of the vector v at time t, denoted by v(t), for the 
vertex itself and its neighbours: 



where 6 > is a parameter determining the speed of the descent. 

Assuming that the natural cluster of vertex s is small compared to the order 
of the graph n, the normalisation 1 1 v 1 1 ^ = n entails that most vertices j in the 
network will have Vj « 1. Thus the descent iterations (31) can be started from 
an initial vector v(0) that has v s (0) — for the seed vertex s G V and i)fe(O) = 1 
for all k 7^ i. The estimates need then to be updated at time t > only for 
those vertices j that have at least one neighbour k such that Vk(t — 1) < 1. 

Balancing the constraint weight c against the speed of gradient descent S 
naturally requires some care. We have obtained reasonably stable results with 
the following heuristic: given an estimate k for the average degree of the vertices 
in the network, set c = 1/k and 5 = c/10. The gradient iterations (31) are then 
continued until all the changes in the w-estimates are below e = S/10. We leave 
the calibration of these parameters to future work. 

The (approximate) Fiedler values thus obtained represent proximity-values 
of the vertices in V to the cluster of vertex s. Determining a bisection into S 
and V \ S is now a one-dimensional two-classification task that can in principle 
be solved using any of the standard pattern classifiers, such as variations of the 
basic k- means algorithm (36). 

We illustrate the applicability approximate absorption times for clustering 
the karate club network (Figure 2). The approximate absorption times shown in 
Figure 8 are computed directly with Equation (27): the group structure is seen 
to be strong when the seed vertex is one of the central members of the group, 
whereas the classification task is harder for the "border" vertices, as can be 
expected. For more extensive examples of clustering with the locally computed 
approximates, we refer the reader to previous work (11). 

5 Conclusions and further work 

In this work we have derived an expression for the absorption times to a single 
absorbing vertex s in a simple random walk in an undirected, unweighted graph 
in terms of the spectrum of the normalised Laplacian matrix of the graph. We 
have shown that by only knowing the Fiedler vector corresponding to s on the 
boundary and the corresponding eigenvalue provides an approximation of the 




(30) 



Vj(t + 1) = vj(t) + S ■ [Y^Vk ~ (dj - c) • vj 




(31) 
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Figure 8: Four examples of two-classifying vertices of the Zachary karate club 
graph. The examples on the left have the seed vertex among the "rectangles" of 
Figure 2 and the examples of the right have the seed vertex among the "circles" . 
The vertices are ordered by their label in Figure 2 and a zero has been inserted 
to represent the absorption time to the seed vertex itself. The group in which 
the seed belongs is drawn in black and the other group in white. 



absorption times if the spectrum of the graph presents a gap after the first 
eigenvalue. Experimentally we have confirmed that the values given by the 
approximation are nearly perfectly correlated with the exact absorption times 
even in the absence of such a gap. 

Our motivation is to use the absorption times into a seed vertex s as a 
measure of proximity in two-classifying the graph into two partitions: vertices 
that are "relevant" to the seed vertex and other vertices. Hence, not knowing the 
exact values but rather another vector of perfectly correlated values is sufficient 
for separating between the vertices with higher values from those with lower 
values (which is the classical two-classification task). 

Such a two-partition of a graph is known as local clustering. In order for the 
proposed values to be locally computable, we have also presented a gradient- 
descent method to approximate the Fiedler vector using only local information 
in the graph. The method iteratively processes the neighbourhoods of vertices 
starting from the seed vertex and expanding outwards within the group of po- 
tentially "relevant" vertices, without any need to process other parts of the 
graph. We have illustrated the potential of these vectors in two-classification 
for local clustering on a classical example graph representing a social network. 

In further work, we seek to study further the effects of the presence or ab- 
sence of a spectral gap in the input graph into the approximation proposed. We 
also want to calibrate the parameters of the locally computable approximation 
in such a way that no a priori knowledge of the input graph would be needed, 
but that the method would rather adapt to the structure of the graph at run- 
time by dynamic parameter adjustment. Of additional interest are extensions 
of this work to weighted and directed graphs as well as case studies of appli- 
cations of local clustering. We also contemplate possible uses for approximate 
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absorption times in resolving other problems of interest that involve complex 
systems represented as graphs. 
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